Predicting accurate probabilities with a ranking loss

نویسندگان

  • Aditya Krishna Menon
  • Xiaoqian Jiang
  • Shankar Vembu
  • Charles Elkan
  • Lucila Ohno-Machado
چکیده

In many real-world applications of machine learning classifiers, it is essential to predict the probability of an example belonging to a particular class. This paper proposes a simple technique for predicting probabilities based on optimizing a ranking loss, followed by isotonic regression. This semi-parametric technique offers both good ranking and regression performance, and models a richer set of probability distributions than statistical workhorses such as logistic regression. We provide experimental results that show the effectiveness of this technique on real-world applications of probability prediction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Minimizing the Position Error in Label Ranking

Conventional classification learning allows a classifier to make a one shot decision in order to identify the correct label. However, in many practical applications, the problem is not to give a single estimation, but to make repeated suggestions until the correct target label has been identified. Thus, the learner has to deliver a label ranking, that is, a ranking of all possible alternatives....

متن کامل

Situational Assessments Based on Uncertainty-Risk Awareness in Complex Traffic Scenarios

Situational assessment (SA) is one of the key parts for the application of intelligent alternative-energy vehicles (IAVs) in the sustainable transportation. It helps IAVs understand and comprehend traffic environments better. In SA, it is crucial to be aware of uncertainty-risks, such as sensor failure or communication loss. The objective of this study is to assess traffic situations considerin...

متن کامل

Modeling Basketball’s Points per Possession With Application to Predicting the Outcome of College Basketball Games

In this paper we consider how to model basketball's points per possession data, and we show that the flexibility provided by a multinomial logistic regression is required for modeling this type of data. We show how to apply this model to ranking college basketball teams, and a method for estimating team win probabilities with this model is provided. We show how to use these win probabilities to...

متن کامل

Fast Unsupervised Automobile Insurance Fraud Detection Based on Spectral Ranking of Anomalies

Collecting insurance fraud samples is costly and if performed manually is very time consuming. This issue suggests usage of unsupervised models. One of the accurate methods in this regards is Spectral Ranking of Anomalies (SRA) that is shown to work better than other methods for auto insurance fraud detection specifically. However, this approach is not scalable to large samples and is not appro...

متن کامل

Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels

BACKGROUND Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed application to high-density genotype data on all individuals, which may not be the case in practi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning

دوره 2012  شماره 

صفحات  -

تاریخ انتشار 2012